Learning an activity-based semantic scene model

نویسنده

  • Dimitrios Makris
چکیده

This thesis investigates how scene activity, which is observed by fixed surveillance cameras, can be modelled and learnt. Modelling of activity is performed through a spatio-probabilistic scene model that contains semantics like entry/exit zones, paths, junctions, routes and stop zones. The spatial nature of the model allows physical and semantic representation of the scene features, which can be useful in applications like video annotation and contextual databases. The probabilistic nature of the model encodes the variance and the related uncertainty of the usage of the scene features, which is useful for activity analysis applications, such as motion prediction and atypical motion detection. A variety of models and learning methods are used to represent and automatically derive particular activity-based semantic scene elements. Expectation-Maximisation is used for learning Gaussian Mixture Models and accumulative statistics in image maps are integrated in the methods presented. Also, a novel route model and an appropriate learning algorithm are introduced. Additionally, a Hidden Markov Model superimposed on the scene model is used for enabling activity analysis. The application of the methods is investigated for single cameras and collectively across multiple cameras. Additionally, a novel automatic cross-correlation method is introduced that reveals the topology of a network of activities, as observed by a network of uncalibrated cameras. The method is important not only because it provides an integrated activity model for all the cameras, but also because it provides a mechanism to automatically estimate the topology of the camera network, modelling the activity across the “blind” areas of the surveillance system. All the proposed learning algorithms are unsupervised to allow automatic learning of the scene model. Their input is a set of noisy trajectories derived automatically by motion tracking modules, attached to each of the cameras. 9 ________________________________________________________________________ List of Abbreviations AHMM: Abstract Hidden Markov Model BBN: Bayesian Belief Network blob: Binary Large OBject CCTV: Closed Circuit TeleVision CHMM: Coupled Hidden Markov Model CUES: City University Experimental Surveillance DBN: Dynamic Belief Network DDN: Dynamic Decision Network DPN: Dynamic Probabilistic Network EM: Expectation Maximisation FLTC: Fuzzy Logic Trajectory Classifier FOV: Field Of View GM Gaussian Model GMM: Gaussian Mixture Model GP: Ground Plane GPC: Ground Plane Constraint HHMM: Hierarchical Hidden Markov Model HMM: Hidden Markov Model ICA: Independent Component Analysis LMS: Least Median of Squares MCAN: Multiple Camera Activity Network MDL: Minimum Description Length ML: Maximum Likelihood MLTC: Maximum Likelihood Trajectory Classifier NIHC: Numeric Iterative Hierarchical Cluster NN: Neural Network PCA: Principal Component Analysis pdf: probability distribution function PDM: Point Distribution Model RBHMM Route-Based Hidden Markov Model SVM: Support Vector Machine TCM: Tracking Correspondence Model VLHMM Variable Length Hidden Markov Model VMD: Video Motion Detection VQ: Vector Quantisation Chapter 1: Introduction 10 ________________________________________________________________________

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Learning of an Activity-Based Semantic Scene Model

This paper proposes an activity-based semantic model for a scene under visual surveillance. It illustrates methods that allow unsupervised learning of the model, from trajectory data derived from automatic visual surveillance cameras. Results are shown for each method. Finally, the benefits of such a model in a visual surveillance system are discussed.

متن کامل

بازیابی تعاملی تصاویر طبیعت با بهره گیری از یادگیری چند نمونه ای

Content-based image retrieval (CBIR) has received considerable research interest in the recent years. The basic problem in CBIR is the semantic gap between the high-level image semantics and the low-level image features. Region-based image retrieval and learning from user interaction through relevance feedback are two main approaches to solving this problem. Recently, the research in integra...

متن کامل

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

A Hybrid Algorithm based on Deep Learning and Restricted Boltzmann Machine for Car Semantic Segmentation from Unmanned Aerial Vehicles (UAVs)-based Thermal Infrared Images

Nowadays, ground vehicle monitoring (GVM) is one of the areas of application in the intelligent traffic control system using image processing methods. In this context, the use of unmanned aerial vehicles based on thermal infrared (UAV-TIR) images is one of the optimal options for GVM due to the suitable spatial resolution, cost-effective and low volume of images. The methods that have been prop...

متن کامل

Compressed Domain Scene Change Detection Based on Transform Units Distribution in High Efficiency Video Coding Standard

Scene change detection plays an important role in a number of video applications, including video indexing, searching, browsing, semantic features extraction, and, in general, pre-processing and post-processing operations. Several scene change detection methods have been proposed in different coding standards. Most of them use fixed thresholds for the similarity metrics to determine if there wa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004